Searching for Supermaximal Repeats in Large DNA Sequences
نویسندگان
چکیده
We study the problem of finding supermaximal repeats in large DNA sequences. For this, we propose an algorithm called SMR which uses an auxiliary index structure (POL), which is derived from and replaces the suffix tree index STTD64 [1]. The results of our numerous experiments using the 24 human chromosomes data indicate that SMR outperforms the solution provided as part of the Vmatch [2] software tool. In searching for supermaximal repeats of size at least 10 bases, SMR is twice faster than Vmatch; for a minimum length of 25 bases, SMR is 7 times faster; and for repeats of length at least 200, SMR is about 9 times faster. We also study the cost of POL in terms of time and space requirements.
منابع مشابه
Fast Computation of Supermaximal Repeats in DNA Sequences
Fast Computation of Supermaximal Repeats in DNA Sequences
متن کاملRepMaestro: scalable repeat detection on disk-based genome sequences
MOTIVATION We investigate the problem of exact repeat detection on large genomic sequences. Most existing approaches based on suffix trees and suffix arrays (SAs) are limited either to small sequences or those that are memory resident. We introduce RepMaestro, a software that adapts existing in-memory-enhanced SA algorithms to enable them to scale efficiently to large sequences that are disk re...
متن کاملSearching the genome of beluga(Husohuso) for sex markers based on targeted Bulked SegregantAnalysis (BSA)
In sturgeon aquaculture, where the main purpose is caviar production, a reliable method is needed to separate fish according to gender. Currently, due to the lack of external sexual dimorphism, the fish are sexed by an invasive surgical examination of the gonads. Development of a non-invasive procedure for sexing fish based on genetic markers is of special interest. In the present study we empl...
متن کاملSearching the genome of beluga (Huso huso) for sex markers based on targeted Bulked Segregant Analysis (BSA)
In sturgeon aquaculture, where the main purpose is caviar production, a reliable method is needed to separate fish according to gender. Currently, due to the lack of external sexual dimorphism, the fish are sexed by an invasive surgical examination of the gonads. Development of a non-invasive procedure for sexing fish based on genetic markers is of special interest. In the present study we empl...
متن کاملSimple Sequence Repeats Amplification: a Tool to Survey the Genetic Background of Olive Oils
A reliable DNA extraction method for use on extra virgin olive oil based on a commercial kit was defined, and the possibility of using this DNA for fingerprinting the original cultivar was demonstrated. The genetic traceability of single-cultivar virgin olive oil from two cultivars (Carolea and Frantoio) was achieved by identifying the varieties from which they were produced. This involved the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008